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Abstract. Motivated by applications to social network analysis (SNA), 
we study the problem of finding the maximum number of disjoint uni- 
color paths in an edge-colored graph. We show the NP-hardness and 
the approximability of the problem, and both approximation and exact 
algorithms are proposed. Since short paths are much more significant 
in SNA, we also study the length-bounded version of the problem, in 
which the lengths of paths are required to be upper bounded by a fixed 
integer I. It is shown that the problem can be solved in polynomial time 
for I = 3 and is NP-hard for / > 4. We also show that the problem can 
be approximated with ratio (J — l)/2 + e in polynomial time for any 
e > 0. Particularly, for I = 4, we develop an efficient 2-approximation 
algorithm. 
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1 Introduction 

A social network is usually modeled by a graph G = (V, E), in which V is 
the set of actors and EC. V x V is the binary relation we are interested in. 
In the terminology of graph theory, V is the node set and E is the edge set. 
The connectivity, or node connectivity, of two nodes is the minimum number of 
nodes whose removal separates the two nodes. By Menger's theory, it is equal 
to the maximum number of disjoint paths between the two nodes and also can 
be thought of as a simpler form of the maximum flow between them. In social 
network analysis (SNA), connectivity is a basic measurement of information flow 
between nodes and also used to define cohesion group and centralities [316116] . 
Thus computing the connectivity of two nodes is an important problem in SNA. 

When there are more than one kinds of relations, we can model a multi- 
relations social network by a graph with more than one edge sets. Let c be a 
positive integer. A c-relations social network can be described by G = (V, £), in 
which V is the set of nodes and £ = {Ei,E2, ■ . ■ , E c } is a collection of c edge 
sets. For 1 < i < c, Ei C V x V represents the j-th relation, and we shall say 
the edges in Ei are of color i. Note that there may be edges of different colors 
between one pair of nodes. For a fixed c, a graph is called as a c-colors graph if 



there arc at most c colored edge sets, and simply a "color graph" if the number 
of colors is not fixed or need not be specified. 

A path is of uni-color if all the edges of the path are of the same color. Two 
paths are internally disjoint if they have no common internal node, and a set 
of paths are internally disjoint if they are mutually internally disjoint. In this 
paper we shall simply use "disjoint" . The decision version of the main problem 
discussed in this paper is defined as follows. 

Problem: The disjoint paths problem on color graphs (CDP) 
Instance: A color graph G, two nodes s, t g V and a positive integer p. 
Question: Are there p disjoint uni-color paths from s to f? 

In general, the graph may be directed or undirected but in this paper we only 
consider undirected graphs. We shall use the name "CDP cp " for the decision 
problem of which the input is a c-colors graph. The maximization version, de- 
noted by Max CDP problem, asks for the maximum number of disjoint uni-color 
paths between two given nodes, which will be called as their colored connectiv- 
ity. When there is only one color, the maximum number of disjoint paths, i.e, 
the traditional connectivity, can be computed in polynomial time by solving 
the maximum flow problem. But the colored connectivity problem, to our best 
knowledge, has not been studied yet. A related but different problem studied in 
the literature is the minimum color path problem which is motivated by com- 
munication reliability and the goal is to find a path or two disjoint paths with 
minimum number of colors [11117] . Other related problems also includes the min- 
imum color-cost path problem |7j and properly colored path problems, seeing [5] 
for example. 

The motivation of studying the colored connectivity is natural. Most of the 
researches in SNA consider only single relation. But in practical there are more 
than one kinds of relations. The Max CDP problem arises if the information flow 
or the influence spread only along relations of the same kind. Computer virus 
spreading is an example. One virus usually spreads only along one or several 
particular computer softwares. Conversations among people is another example. 
People usually talk different topics with the ones of different relations. Disjoint 
paths also play an important role in data communication when security or traffic 
congestion is concerned. Thus the scenario of the Max CDP problem may also 
occur if different types of links between nodes are considered, either due to 
different media or different protocols. 

The results and the organization of this paper are as follows. In Section 2, 
first we show that the CDP problem is NP-complete even for 2-colors graphs and 
that the Max CDP problem cannot be approximated with ratio less than two, 
unless NP=P. And then we give an 0(mn)-time c-approximation algorithm for c- 
colors graphs. Throughout this paper, m and n denote the numbers of edges and 
nodes of the input graph G, respectively. An extreme example is given to show 
the tightness of the ratio. Also we give an 0((m + n)c n ) time exact algorithm for 
the problem. Since, in social network analysis, short paths are considered much 
more significant than long paths, we also study the length-bounded version of 
the Max CDP problem, namely Z-LCDP, in which the lengths of solution paths 



are required to be upper bounded by a fixed integer I. In Section 3, we show 
that the l-LCDP problem can be solved by graph matching for I = 3 and is 
NP-hard for I > 4. We also show that, for any fixed e > 0, the ^-LCDP problem 
can be approximated with ratio (I — l)/2 + e in polynomial time. Particularly, 
for a c-colors graph, we give an efficient 2-approximation for I = 4 with time 
complexity 0{p 2 {c 2 n + cm)), in which p is the number of paths found by the 
algorithm. In most of the applications, it is a linear time algorithm. 

2 Complexity and approximability 

In this section, we show the complexity and the approximability of the CDP 
problem. First, in Section 2.1, we show that the problem is NP-complete, and 
the proof also implies that the Max CDP problem is NP-hard and cannot be 
approximated with ratio less than two, unless NP=P. In Section 2.2, we give a 
simple c-approximation algorithm for c-colors graphs and an extreme example 
to show the sharpness of the ratio. In Section 12.31 we propose an algorithm for 
finding the exact solution. For a c-colors graph G — (V, {Ei, E2, ■ ■ ■ E c }), we 
shall denote {V, E t ) by G l . 

2.1 NP-completeness 

To show the NP-hardness of the CDP problem, we introduce the following similar 
problem, named MCDP C in short. 

Problem: The multi-pairs disjoint paths problem on c-colors graphs 
Instance: A c-colors graph G, c pairs (s;,ii), 1 < i < c, of nodes. 
Question: Is there a color-i path Pi from Si to ti for each 1 < i < c 
such that Pi and Pj are internally disjoint for all i and jl 

The reduction from the MCDP2 problem to the CDP2.2 problem is quite 
straightforward. We first assume that all nodes in the given pairs are distinct, and 
the other case will be explained later. For an instance of the MCDP2 problem, 
we construct a graph G' from G by adding two new nodes s and t, as well as four 
edges (s,si), (s,S2), (t,ti), and (£,£2)- The edges (s,si) and have color 

one and the other two new edges have color two. Apparently there exist two 
disjoint uni-color si-paths in G' if and only if the answer of the MCDP2 problem 
is also "yes" . Therefore if the MCDP2 problem is NP-complete, so is the CDP2.2 
problem. In the case that s\ — S2, we can add a duplicate s[ of si such that s[ 
has the same neighbors as s±, and the edges incident to s are (s, s\) and (s, s[) 
instead. Other cases that any two nodes in the given pairs are not distinct can 
also be handled similarly. We shall show the NP-completeness of the MCDP2 
problem by transformation from the SAT problem. We remind that the MCDP 
problem on 1-color graphs is polynomial-time solvable when the number of pairs 

is fixed una. 

Let Ci, 1 < i < q be the clauses of the SAT problem and Xi, 1 < i < r, the 
variables. We construct a 2-colors graph G = (V, {E\, E 2 }) as follows. The node 
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Fig. 1. Transformation from SAT problem to MCDP2 problem: (a), edges of 
color 1; and (b). edges of color 2. 

set mainly consists of {si,ti, 33, t2}U{x J i \x i e Cj, 1 < i < r, 1 < j < q}U{x\\xi € 
Cj, 1 < i < r, 1 < j < (?}, and some other nodes for some "switches" (explained 
later). The edges of color 1 and 2 are depicted in Figured] 

Gi is an (q + 2)-stages graph, in which the i-th stage corresponding to a 
clause Cj for 1 < i < q, and the 0-th and the (q + l)-th stages are s% and 
ti, respectively. Two consecutive stages are connected as a complete bipartite 
graph. Note that, for simplicity, the super scripts of nodes are not shown in the 
figure. Different nodes are used to represent a same literal X{ or xi appearing in 
different clauses. 

For color 2, all occurrences of a same literal, i.e, x\ or x\ for all j, are 
connected to form a path, and the four paths of two consecutive variables are 
connected by a 2 x 2 switch as shown in the figure. 

Lemma 1. If and only if there is a truth assignment satisfying all the clauses, 
there are an siti-path in Gi and an s^ti-path in Gi, which are disjoint. 

Proof. If the instance of SAT problem is satisfiable, we may have an S2^2-path 
in G2 passing through all literals which are assigned False. That is, for each i, 
the path passes through X{ if Xi =False; and through Xi otherwise. Since this 
truth assignment satisfies all clauses, each clause has a literal assigned True, and 
therefore there is a path from si to ti in G\. 

Conversely, suppose that there are two such disjoint paths. Since there is an 
si^i-path in G\, each stage has a node not used by the path in G2. We observe 
that, in G2, any S2t2-path passes through all occurrences of either Xi or X{ for 
every i. Therefore if we assign n True if it is not passed by the path in G2 and 
assign False otherwise, every clause has a literal assigned True and the instance 
is satisfiable. □ 



Fig. 2. A tight example for the c-approximation algorithm, in which edges are 
labeled by their color number. 

Since the MCDP 2 and the CDP problems are apparently in NP, we obtain the 
following theorem. 

Theorem 1. The MCDP 2 problem is NP-complete. The CDP problem is NP- 
complete even for determining if there exist 2 paths in a 2- colors graph. 

Corollary 1. The Max CDP problem is NP-hard and cannot be approximated 
in polynomial time with ratio 2 — e for any e > 0, unless NP=P. 

Proof. Since determining one or two paths is NP-complete, it is impossible to 
approximate the optimal with ratio less than two in polynomial time, unless 
NP=P. □ 



2.2 An approximation algorithm 

By Ki(s,t) we denote the connectivity of s and t in graph Gi, i.e., the maxi- 
mum number of disjoint paths between them. When the subscript is omitted, 
k(s, t) denotes the maximum number of disjoint paths of uni-color. We show the 
following greedy algorithm is a c-approximation algorithm for c-colors graphs. 

For each color i, find Ki(s,t). Select the color i with maximum Ki(s,t) 
and put these paths into solution. Remove all internal nodes of these 
paths, and then repeat the previous step until no path remains. 

Theorem 2. The Max CDP problem can be c- approximated in 0(mn) time for 
c-colors graphs. 

Proof. Apparently the optimal solution re(s, t) < c • max^ fc,(s, t). The approxi- 
mation ratio follows from that the number of paths found by the algorithm is 
at least max^ ftj(s, t). The value Ki(s,t), i.e., connectivity in a uni-color graph, 
can be found by solving a maximum flow problem [9l p. 212] and therefore 
takes 0(Ki(s,t)\Ei\) time |l|2j . In total the algorithm takes 0(K(s,t)m) time, or 
0{mn) time since K{s,t) < n. □ 

Figure [5] illustrates a tight example of the c-approximation algorithm. The 
optimal solution contains c disjoint paths (the horizontal ones), one for each 
color. But if we choose the bold path of color 1 at the first iteration, the algorithm 
will find only one path. 



2.3 An exact algorithm 



First, for any color i, if (s, t) £ Ei, this path of single edge must be in the optimal 
solution, and we can put it into the solution and remove this edge. Therefore, 
in the remaining paragraphs of this paper, we assume (s,t) ^ Ei for any i. For 
a c-colors graph G, define a node coloring S : V — {s, t} i-> {l..c}. Two nodes 
are said to be assigned the same color i if S(u) = 8(v) = i. For the convenience, 
nodes s and t are thought of having the same color as any node in any coloring. 
Let Ei[S], 1 < i < c, denote the subset of Ei in which the two endpoints are 
assigned the same color i by 8. Let E[5] = \JiEi[S] an d be the uni-color 
graph induced by the edge set E[S\. Suppose that V is an optimal solution of 
the Max CDP problem. Let 8* be a node coloring such that S*(v) — i if v is on 
a path of color i in V; and S*(v) is arbitrary otherwise. 

We can observe that any path in V must also be a path in G[S*} and any 
path in G[S*] corresponds to a uni-color path in G. Thus, \V\ equals the si- 
connectivities on G[S*} and can be computed in 0(mn) time. If we individually 
solve the maximum flow problems for all colorings, the total time complexity 
will be 0{mnc n ). By the following observations, the complexity can be reduced 
to 0((m + n)c n ). Using the generalized Gray code, all the c" colorings can be ar- 
ranged in an order 81,82, ■■ ■ such that two consecutive colorings differ at only one 
node, and thus the maximum flow corresponding to G[$i+i] can be obtained from 
that corresponding to G[Si] by performing at most two breadth-first-searches on 
the residual graph. The next theorem states the result but the detailed proof is 
omitted here. 

Theorem 3. There exists an 0((m + n)c n ) time algorithm for the Max CDP 
problem on c-colors graphs. 

3 Length-bounded cases 

In this section we discuss the Max CDP problem with bounded length. The 
length of a path is the number of edges in this path. When the path lengths 
are required to be upper bounded by a fixed integer /, we name the problem by 
Z-LCDP. An edge (u, v) e Ei will be denoted by (u, v; i), and {v\, V2, ■ ■ ■ , v m ; i) 
denote a path of color i and visiting v\,V2, ■ ■ ■ ,v m in this order. The cases of 
I < 2 can be easily solved, and we shall discuss the cases of I = 3 and 4. 

3.1 A polynomial time algorithm for 3-LCDP 

The set of all common neighbors of nodes s and t of color i is denoted by iVJ t . 
Recall that we have assumed (s, t) ^ Ei for all i, and we need only consider paths 
of length at least 2. An st-path of length two has the form (s,v,t;i), i.e., any 
co-neighbor of s and t may contribute a path. The next claim comes from that 
any si-path of length two is disjoint to any others of length 2 and may intersect 
at most one si-path of longer length. 



Claim. If v £ iV] t for any i, there is an optimal solution of the 3-LCDP problem 
containing the path (s, v, t; i). 



Algorithm 1 

Input: A c-colors graph G and two nodes s and t. 

Output: The maximum number of disjoint uni-color st-paths of length at most 3. 
1: S 0; > solution set 

2: for k 1 to c do 

3: for each node v £ Nl t , add path (s, v, t; i) into S and remove v from G; 
4: end for 

5: Fi <— {< it, v > |{(s, it), (it, v), (i>,t)} C for 1 < i < c; > ordered pairs 

6: F <— (J. i^i and construct the directed graph iif induced by F; 
7: find a maximum match M of H; 
8: for all < u,v >£ M do 

9: add (s, u, w, t; i) into S for some i such that (u, w) £ Fi] 
10: end for 
11: return S. 



Algorithm 1 is the proposed method for solving the 3-LCDP problem exactly. 
Besides the above claim, the correctness of the algorithm is due to the next claim 
which can be shown by observing that a set of disjoint st-paths corresponds to 
a matching on H, and vice versa. We remind that defining F as a set of ordered 
pairs is only for the sake of making step [9] easier. The maximum matching on a 
directed graph is the same as the one on an undirected graph. 

Claim. Suppose that |L iVJ t = 0. A maximum matching M of the graph H 
constructed in Algorithm [1] corresponds to an optimal solution of the 3-LCDP 
problem. 

The time complexity is dominated by the step of finding a maximum cardinality 
matching of a general graph, which can be done in 0(y/nm) time [10] . 

Theorem 4. The 3-LCDP problem on color graphs can be exactly solved in 
0{\fnm) time. 

3.2 The complexity of 4-LCDP and an approximation algorithm 

For the length-bounded case, the notations n\{s,t) and K l (s,t) are analogous to 
the ones without superscript but those paths are of length at most I. 

Theorem 5. The l-LCDP problem on c-colors graphs is NP-hard for fixed I > 4 
and c > 2. 

Proof. It is sufficient to show the case of I — 4 and c = 2. We show the NP- 
hardness by transforming from a restrict version of the SAT problem in which 
there are at most 3 occurrences of each variable. This version of SAT problem still 
remains NP-complete |4|15j . Let d, 1 < i < q, be the clauses and Xj, 1 < j < r, 




Fig. 3. Transformation from SAT problem to 4-LCDP problem: (a), edges of 
color 1; and (b). edges of color 2. 



the variables. For any variable Xi, if all the occurrences of Xi are positive, we 
can assign Xi True and remove Xi from all clauses. The case of all occurrences 
are negative is similar. Therefore we can assume the occurrences of each variable 
are neither all positive nor all negative. As a result, both Xi and Xi occur at 
most twice for any 1 < i < r. Given an instance of the restrict SAT problem, 
we construct a 2-colors graph as in Figure [3] Since the number of occurrences of 
each literal is at most two, any si-path of any color has length at most 4. Since 
both the degree of s and t are q, nf(s,t) < q, and the maximum is achieved if 
for any clause there is a literal not used in Gi. On the other hand, «;f(s,t) < r 
since the degree of s is r. We can also easily find r disjoint si-paths in Gi as long 
as for each i we use either Xi or X^ RS the internal nodes. If the SAT instance is 
satisfiable, let T be a truth assignment satisfying all the clauses. We choose Xi 
as internal nodes in Gi if Xi is assigned False in T; and Xi otherwise. Then we 
can have q disjoint si-paths of color 1 since there exists a literal assigned True 
in each clause and thus not used in color 2. The total number of disjoint paths is 
q + r. Conversely if there are q + r disjoint si-paths, there are exactly q paths in 
Gi and r paths in G2 ■ Therefore for each variable either itself or its negation is 
used in G2. Since there are q disjoint paths in Gi, each clause contains at least 
one literal not used in Gi- So we can assign Xi True if it is not used in G2 and 
False otherwise, and all the clauses are satisfied. □ 



The c-approximation algorithm in Section [2.2l also works for length bounded 
case. We may achieve a better approximation ratio for small /. 

Theorem 6. For any fixed e > and I > 3, the l-LCDP problem can be ap- 
proximated with ratio (I — l)/2 + e in polynomial time. 

To show Theorem we introduce the following problem, and Algorithm [5] is 
a (fc/2 + ^-approximation algorithm shown in [8]. 

Problem: The Maximum Set Packing (MSP) problem 

Instance: A collection T of fc-element subsets T l7 1 < i < p, of a 

universal set U of total q elements. 

Goal: A maximum disjoint sub-collection of T . 



Algorithm 2 

Input: An instance T of the MSP problem and an integer parameter s > 1. 
Output: A disjoint sub-collection of T. 

S <— 0; > solution set 

while 3 i + 1 disjoint subsets intersecting at most i subsets in S for any i < s do 
replace the i subsets in 5* with the new i + 1 subsets; 

end while 

Output 5". 



Let OPT denote the maximum number of disjoint subsets and APP denote the 
result obtained by Algorithm [2j It was shown in [8] that 



By transforming to the MSP problem, the Z-CDP problem can be approxi- 
mated with ratio (l — l)/2 + e for any e > 0. A direct transformation is as follows. 
Let (G, s, t) be an instance of the ^-LCDP problem. 

— For each uni-color si-path of length at most I, create a subset Ti consisting 
of the internal nodes of the path. There are at most 0(n /_1 ) subsets and 
\Ti\ < I- 1 for each %. 

— The elements are all the nodes in the graph except s and t. 

— Any disjoint sub-collection corresponds to a set of disjoint uni-color paths. 

The stop condition of the while-loop can be implemented by enumerating all 
possible i + 1 subsets, testing if they are disjoint in 0((i + l) 2 l) time, and counting 
the intersected subsets in S in 0((i + l)\S\l) time. Since I is fixed, i < s, and 
s is also a constant determined by e, this step takes 0(|T| l+1 x \S\). Since 
| S | is increased at least one after each iteration and bounded by 0(n), the 
naive implementation has time complexity 0(|T| S+1 x IS*! 2 ) = 0(n( l ~ 1 ^ s+1 * >+2 ), 
which is polynomial for fixed I and e. Theorem [6] follows from Eq. (JTJ), the 
transformation and the above analysis of the time complexity. 




(1) 



3.3 An efficient 2-approximation algorithm for 4-LDCP 



Particularly, when I — 4 and s = 1, by substituting k = I — 1 = 3, the approx- 
imation ratio by Eq. JTJ is (3 X 2 - 2)/(2 2 - 2) = 2. That is, it takes 0(n 8 ) 
time to compute a 2-approximation of the 4-LDCP problem. Although in poly- 
nomial time, it becomes intractable even for graphs of moderate size. In the 
following, we aim at developing a more efficient algorithm for s = 1 and I = 4. 
Let S = {Ti\l < i < \S\} denote the solution found so far, in which Tj is the set 
of internal nodes of an si-path. Let Vq = V — (J. Ti be the nodes not used yet. 
When s = 1, the while-condition can be implemented by 

For each Ti € 5 1 , determine if there are two disjoint si-paths of length at 
most 4 in G[V U T;]. 

The key point is how to determine if K 4 (s,i) > 2 in a color graph without 
generating all possible paths. We shall use the following notations. The distance, 
or shortest path length, between s and t in graph Gi is denoted by di(s,t). A 
node v is an st-cut node in graph Gi if its removal separates the two nodes, i.e., 
Ki(s, t) = after removing v. The set of all such cut nodes is denoted by Xi. 



Algorithm 3 

Input: A color graph G and two nodes s and t. 

Output: Return True iff there are two disjoint paths of length at most four. 
1: for each color i do 

2: remove any node v in Gi such that dj(s, v) + di(v, t) > 4. 
3: end for 

4: if K%(s,t) = 2 for some i then 
5: return True; 
6: end if 

7: for all i and j such that /tf(s,i) = K j( s >£) — 1 do 

8: if Test(i, j) =True then 

9: return True; 

10: end if 
11: end for 
12: return False. 

1: procedure Test(? ,j) > testing if there are two disjoint paths in Gj and 
Gj, resp, of length at most four. It is ensured that X q (s,t) ^ 0, for q = i,j 
and d q (s, v) + d q (v, t) < 4 for any node v in Gj or Gj. 

2: repeat 

3: Gi i Gi Xj ; Gj 4 Gj Xi', 

4: until both Gi and Gj are unchanged or d q (s, t) > 4 for q = i or j; 
5: if di(s, t) > 4 or dj(s, t) > 4 or X t n Xj ^ then 
6: return False; 

7: else if di(s,t) < 3 or dj(s,t) < 3 then 
8: return True; 

9: end if 



10: if more than two si-paths of length 4 in Gi or in Gj then 
11: return True; 

12: else > at most two length-4 paths in Gi and in Gj. 

13: determine and return the result by a brute force method; 

14: end if 

15: end procedure 

Lemma 2. Algorithm^ is correct and takes 0(c 2 n + cm) time. 

Proof. The algorithm returns True iff nf(s,t) > 2 for some color i or there are 
two uni-color disjoint paths of two colors. Clearly, what we need to show is the 
correctness of the procedure Test. 

By the assumption that {s,t) ^ E{ for all i, we need not consider the case 
that di(s,t) — 1 or dj(s,t) = 1. The test procedure starts with a repeat-until 
loop to remove any si-cut node of one graph from the other. Note that the loop 
is necessary since removing nodes from a graph may result in new cut nodes. 
But the loop will only be executed at most four times since each graph has one 
si-cut node originally and can have at most three si-cut nodes or otherwise s 
and t will have distance more than 4 (including oo, i.e., disconnected). 

Step [5] deals with the case that the distance between s and t in either graph 
exceeds 4 or there exists any common si-cut node. At the beginning of step 
we have that Xi n X, = and the distance between s and t at either graph is 
at least two. Let x e JQ. If di(s,t) — 2, there exists a (unique) si-path (s,x,t) 
of color i. Immediately the output should be True since dj(s,t) < 4 and x is 
not in Gj. The case that dj(s,t) = 2 is similar. If di(s,t) = 3, there is a path 
(s, y, x, t) or (s, x, y, t) in Gi. Since x is not in Gj and y Xj, recalling that we 
have removed any si-cut node of Gj from Gi, the result should also be True. 

The remaining case is di(s,t) = dj(s,t) = 4. Recall that each graph has at 
least one si-cut node. Any length-4 si-path in Gi contains exactly three internal 
nodes, said {x, j/i, 2/2}, in which x € X j and therefore not in Gj. Furthermore 
neither y\ nor j/2 is in Xj. Hence, removing the three nodes destroys at most two 
paths in Gj. If there are more than two, not disjoint surely, length-4 si-paths 
in Gj, the output should be True. Similarly it holds if there are more than two 
such paths in Gi. The remaining case is that there are one or two paths in either 
graph, and the answer can be obtained by the following method. First we choose 
a path in Gi and check if the removal of the internal nodes separates s and i in 
Gj. If not, we find two disjoint paths. Otherwise we choose the other path in Gi 
if any, and do it again. 

By the above discussion, the test procedure takes linear time, i.e., 0(|V| + 
I Ei I + 1 Ej I ) . The whole algorithm calls the test procedure for each pair of i and j , 
and therefore the total time complexity is 0(c 2 |V| + 2c ^ \Ei\) = 0(c 2 n + cm) 
since the other steps of Algorithm [3] can be done in 0(cn + m) time. □ 

Combining Algorithms [5] and [3J we obtain the next theorem. The time com- 
plexity is obtained as follows. To implement the while-condition of Algorithms [21 
we need to call Algorithm [3J at most \S\ times, where \S\ is the number of 
paths found so far. Let p be the number of paths found by the algorithm. Since 



the while-loop may be executed at most p times, the total time complexity is 
0(p 2 (c 2 n + cm)). 

Theorem 7. There exists an 0(p 2 (c 2 n + cm)) time 2- approximation algorithm 
for the 4-LCDP problem on a c-colors graph, in which p is the number of paths 
found by the algorithm. 

Finally we would like to remark the following. In most of the applications, 
both c and p are small integers, and thus the approximation algorithm runs 
in linear time. Furthermore, since we need only consider the graphs induced 
by {v\di(s,v) + di(v,t) < 4} for each color i, the algorithm is in fact a local 
algorithm and is therefore efficient even for large-scale social networks. 
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